Ultrafast shape recognition to search compound databases for similar molecular shapes

نویسندگان

  • Pedro J. Ballester
  • W. Graham Richards
چکیده

Finding a set of molecules, which closely resemble a given lead molecule, from a database containing potentially billions of chemical structures is an important but daunting problem. Similar molecular shapes are particularly important, given that in biology small organic molecules frequently act by binding into a defined and complex site on a macromolecule. Here, we present a new method for molecular shape comparison, named ultrafast shape recognition (USR), capable of screening billions of compounds for similar shapes using a single computer and without the need of aligning the molecules before testing for similarity. Despite its extremely fast comparison rate, USR will be shown to be highly accurate at describing, and hence comparing, molecular shapes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ultrafast shape recognition for similarity search in molecular databases

Molecular databases are routinely screened for compounds that most closely resemble a molecule of known biological activity to provide novel drug leads. It is widely believed that three-dimensional molecular shape is the most discriminating pattern for biological activity as it is directly related to the steep repulsive part of the interaction potential between the drug-like molecule and its ma...

متن کامل

Otto - von - Guericke - Universität Magdeburg Diplomarbeit Structural Deformable Models for Robust Object Recognition

A hierarchical framework for the recognition of compound deformable shapes is developed. In extension to traditional approaches an additional layer of control is introduced to guide the local search for shapes. This is realized by incorporating knowledge about their spatial relationships. A new technique of expectation maps is applied to allow parallel shape searches to inspire each other. Furt...

متن کامل

Normalized rotation shape descriptors and lossy compression of molecular shape

There is a common need to search of molecular databases for compounds resembling some shape, what suggests having similar biological activity while searching for new drugs. The large size of the databases requires fast methods for such initial screening, for example based on feature vectors constructed to fulfill the requirement that similar molecules should correspond to close vectors. Ultrafa...

متن کامل

USRCAT: real-time ultrafast shape recognition with pharmacophoric constraints

UNLABELLED BACKGROUND Ligand-based virtual screening using molecular shape is an important tool for researchers who wish to find novel chemical scaffolds in compound libraries. The Ultrafast Shape Recognition (USR) algorithm is capable of screening millions of compounds and is therefore suitable for usage in a web service. The algorithm however is agnostic of atom types and cannot discrimina...

متن کامل

Search Space Reduction for Farsi Printed Subwords Recognition by Position of the Points and Signs

In the field of the words recognition, three approaches of words isolation, the overall shape and combination of them are used. Most optical recognition methods recognize the word based on break the word into its letters and then recogniz them. This approach is faced some problems because of the letters isolation dificulties and its recognition accurcy in texts with a low image quality. Therefo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of computational chemistry

دوره 28 10  شماره 

صفحات  -

تاریخ انتشار 2007